Is Cp an empirical Bayes method for smoothing parameter choice?
نویسنده
چکیده
The Cp selection criterion is a popular method to choose the smoothing parameter in spline regression. Another widely used method is the generalized maximum likelihood (GML) derived from a normal-theory empirical Bayes framework. These two seemingly unrelated methods, have been shown in Efron (Ann. Statist. 29 (2001) 470) and Kou and Efron (J. Amer. Statist. Assoc. 97 (2002) 766) to be actually closely connected. Because of this close relationship, the current paper studies whether Cp could also have an empirical Bayes interpretation for smoothing splines as GML does. It is shown that this is not possible. In addition, necessary conditions for a selection criterion to have an empirical Bayes interpretation are given, using which it is shown that a large class of selection criteria, including Akaike information criterion, Bayesian information criterion and Stein’s unbiased risk estimate, does not possess an empirical Bayes explanation. c © 2003 Elsevier B.V. All rights reserved.
منابع مشابه
A new adaptive exponential smoothing method for non-stationary time series with level shifts
Simple exponential smoothing (SES) methods are the most commonly used methods in forecasting and time series analysis. However, they are generally insensitive to non-stationary structural events such as level shifts, ramp shifts, and spikes or impulses. Similar to that of outliers in stationary time series, these non-stationary events will lead to increased level of errors in the forecasting pr...
متن کاملReversing and Smoothing the Multinomial Naive Bayes Text Classifier
Abstract. The naive Bayes text classifier has long been a core technique in information retrieval and, more recently, it has emerged as a focus of research itself in machine learning. This paper is concerned with the naive Bayes text classifier in its multinomial model instantiation. This model and an “equivalent” reversed version proposed here are interpreted under the statistical framework of...
متن کاملبه کارگیری بیز تجربی در تهیه نقشه جغرافیایی بروز بیماری سل در استان مازندران طی سالهای 90-1384
Background and purpose: Due to the increasing information about illnesses and deaths, classified map is of appropriate methods for analyzing this type of data. Standardized infection rates are commonly used in disease mapping but had many defects. This study aimed to compare the Poisson regression models and empirical Bayes models to prepare geographical map of tuberculosis incidence in Mazanda...
متن کاملSmoothers and the Cp, Generalized Maximum Likelihood, and Extended Exponential Criteria: A Geometric Approach
Nonparametric regression, often called smoothing, is a widely used data analysis method. The use of a smoother requires the choice of a smoothing parameter that by balancing delity and roughness controls how much smoothing is done. Two popular selection criteria for choosing the smoothing parameter are Cp and generalized maximum likelihood (GML). Each of these has its own problems. For Cp , t...
متن کاملSmoothing spline Gaussian regression: more scalable computation via efficient approximation
Smoothing splines via the penalized least squares method provide versatile and effective nonparametric models for regression with Gaussian responses. The computation of smoothing splines is generally of the order O.n3/, n being the sample size, which severely limits its practical applicability. We study more scalable computation of smoothing spline regression via certain low dimensional approxi...
متن کامل